Voice Driven Emotion Recognizer Mobile Phone: Proposal and Evaluations

نویسندگان

  • Aishah Abdul Razak
  • Mohamad Izani Zainal Abidin
  • Ryoichi Komiya
چکیده

This article proposes an application of emotion recognizer system in telecommunications entitled voice driven emotion recognizer mobile phone (VDERM). The design implements a voice-to-image conversion scheme through a voice-to-image converter that extracts emotion features in the voice, recognizes them, and selects the corresponding facial expression images from image bank. Since it only requires audio transmission, it can support video communication at a much lower bit rate than the conventional videophone. The first prototype of VDERM system has been implemented into a personal computer. The coder, voice-to-image converter, image database, and system interface are preinstalled in the personal computer. In this article, we present and discuss some evaluations that have been conducted in supporting this proposed prototype. The results have shown that both voice and image are important for people to correctly recognize emotion in telecommunications and the proposed solution can provide an alternative to videophone systems. The future works list some modifications that can be done to the proposed prototype in order to make it more practical for mobile applications. INTRODUCTION AND MOTIVATION Nonverbal communication plays a very important role in human communications (Komiya, Mohd Arif, Ramliy, Gowri, & Mokhtar, 1999). However,

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Multi-Dialectical Languages Effect on Speech Recognition

Research has shown that automatic speech recognition (ASR) performance typically decreases when evaluated on a dialectal variation of the same language that was not used for training its models. Similarly, models simultaneously trained on a group of dialects tend to underperform when compared to dialect-specific models. When trying to decide which dialect-specific model (recognizer) to use to d...

متن کامل

Speech spotter: on-demand speech recognition in human-human conversation on the telephone or in face-to-face situations

This paper describes a novel speech-interface function, called “speech spotter”, which enables a user to enter voice commands into a speech recognizer in the midst of natural human-human conversation. In the past, it has been difficult to use automatic speech recognition in human-human conversation since it was not easy to judge, from only microphone input, whether a user was speaking to anothe...

متن کامل

Speech Spotter: On-demand Speech Recognition in Human-Human Conversation on the Telephone or in Face-to-Face Situations / Masataka Goto

This paper describes a novel speech-interface function, called “speech spotter”, which enables a user to enter voice commands into a speech recognizer in the midst of natural human-human conversation. In the past, it has been difficult to use automatic speech recognition in human-human conversation since it was not easy to judge, from only microphone input, whether a user was speaking to anothe...

متن کامل

Advances in Information Technology Applications

i For the second time, the guest editor has been given an opportunity by IJITWE to publish selected articles from iiWAS and MoMM conference series. This issue contains five articles from the two conferences held in Yogyakarta, In the first article, Hoang, Nguyen, and Tjoa propose a new approach for assisting users in formulating queries for information retrieval. The approach uses a Semantic We...

متن کامل

AMMON: A Speech Analysis Library for Analyzing Affect, Stress, and Mental Health on Mobile Phones

The human voice encodes a wealth of information about emotion, mood and mental state. With mobile phones this information is potentially available to a host of applications. In this paper we describe the AMMON (Affective and Mentalhealth MONitor) library, a low footprint C library designed for widely available phones. The library incorporates both core features for emotion recognition (from the...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • IJITWE

دوره 3  شماره 

صفحات  -

تاریخ انتشار 2008